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The secretory pathway of Pichia pastoris was genetically re-engi- 
neered to perform sequential glycosylation reactions that mimic early 
processing of N-glycans in humans and other higher mammals. After 
eliminating nonhuman glycosylation by deleting the initiating a-^,S- 
mannosyltransferase gene from P. pastoris, several combinatorial 
genetic libraries were constructed to localize active a-1,2-mannosi- 
dase and human |3-1,2-A/-acetylglucosaminyltransf erase I (GnTI) in the 
secretory pathway. First >32 N-terminal leader sequences of fungal 
type II membrane proteins were cloned to generate a leader library. 
Two additional libraries encoding catalytic domains of a-1,2-manno- 
sidases and GnTI from mammals, insects, amphibians, worms, and 
fungi were cloned to generate catalytic domain libraries. In-frame 
fusions of the respective leader and catalytic domain libraries resulted 
in several hundred chimeric fusions of fungal targeting domains and 
catalytic domains. Although the majority of strains transformed with 
the mannosidase/leader library displayed only modest in vivo [i.e., 
low levels of man nose (Man)5-(GlcNAc)2] activity, we were able to 
isolate several yeast strains that produce almost homogenous N- 
glycans of the (Man)5-(GlcNAc)2 type. Transformation of these strains 
with a UDP-GlcNAc transporter and screening of a GnTI leader fusion 
library allowed for the isolation of strains that produce GlcNAc- 
(Man)5-(GlcNAc)2 in high yield. Recombinant expression of a human 
reporter protein in these engineered strains led to the formation of 
a glycoprotein with GlcNAc-(Man}5-(GlcNAc)2 as the primary N-glycan. 
Here we report a yeast able to synthesize hybrid glycans in high yield 
and open the door for engineering yeast to perform complex human- 
like glycosylation. 

The number of protein-based therapeutics entering preclinical 
and clinical evaluation has shown robust growth and is expected 
to increase in the years to come. Fueled by advances in proteomics 
and genomics as well as the ability to engineer and humanize 
monoclonal antibodies, protein-based therapeutics constitute ^500 
candidates currently in clinical trials (1). Several therapeutic pro- 
teins can be made in a prokaryotic expression system such as 
Escherichia coli (e.g., insulin); however, the majority of therapeutic 
proteins require additional posttranslational modifications to attain 
full biological function. N-glycosylation in particular is essential for 
proper folding, pharmacokinetic stability, and efficacy for a large 
number of proteins (2). Most therapeutically relevant glycopro- 
teins, including antibodies, are therefore expressed in mammalian 
cells. However, volumetric productivity, product heterogeneity, 
media cost, retroviral contamination, and the time required to 
generate stable cell lines are generally viewed as drawbacks of 
mammalian cell culture. 

Fungal protein-expression systems do not suffer from the same 
limitations, and protein titers of 14.8 and 35 g/liter have been 
reported for secreted heterologous proteins in yeast and the fila- 
mentous fungus Trichoderma reesei, respectively (3, 4). However, 
glycoproteins derived from fungal expression systems contain non- 
human N-glycans of the high mannose (Man) type, which are 
immunogenic in humans and thus of limited therapeutic value (5). 

Fungi and mammals share initial steps of protein N-glycosylation, 
which involves the site-specific transfer of (Glc)3-(Man)9- 
(GlcNAc)2 from the luminal side of the endoplasmic reticulum 
(ER) to the de novo synthesized protein by an oligosaccharyltrans- 



ferase complex. Subsequent trimming by glucosidases I and II and 
a specific ER-residing a-l,2-mannosidase leads to the formation of 
a (Man)8-(GlcNAc)2 structures (isomer ManSB) (Fig. 1), the 
M-glycan found on most glycoproteins leaving the ER. 

After the export of predominantly (Man)8-(GicNAc)2 con- 
taining glycoproteins to the Golgi, the pathways diverge notably 
between mammals and yeast (6). In the human Golgi a-I,2- 
mannosidases (lA-IC) remove Man to yield the (Manjs- 
(GlcNAc)2 structure, which forms the precursor for complex 
M-glycans (Fig. \A). These mannosidases are typically type II 
membrane proteins with an N-terminal cytosolic tail, a trans- 
membrane domain, a stem region, and a C-terminal catalytic 
domain (Fig. \B), Localization of these proteins, as with most 
enzymes involved in Golgi glycosylation, is mediated by the 
cytosolic tail, the transmembrane region, and the stem (7). 

In Saccharomyces cerevisiae, N-glycosylation has been studied 
extensively and, unlike mammalian N-glycan processing, involves 
the addition of numerous Man sugars throughout the entire 
Golgi, often leading to hypermannosylated N-glycan structures 
with >100 Man residues. This process is initiated in the early 
Golgi by an a-l,6-mannosyltransferase (Ochlp) that prefers 
(Man)8-(GlcNAc)2 as a substrate but is able to recognize various 
other Man oligomers with the notable exception of the human 
(Man)5-(GlcNAc)2 intermediate, which is not a substrate (8). 
After addition of this first a-l,6-Man by Ochlp, additional 
a-l,6-mannosyltransferases will extend the a-1,6 chain, which 
then becomes the substrate for medial- and trans-Golgi-residing 
a- 1,2- and of-l,3-mannosyltransf erases as well as phosphoman- 
nosyltransferases that add yet more Man sugars ito the growing 
N-glycan structure (9). In Pichia pastoris a very similar process 
occurs; however, hypermannosylation occurs less frequently and 
to a lower extent. In addition a-l,3-mannosyltransferase activity 
has not been found in this yeast, and N-glycans from P. pastoris 
do not have a-l,3-Man attached to the outer Man chain (10). 

Humanizing the glycosylation machinery of a yeast strain will 
require the (i) elimination of some endogenous glycosylation 
reactions and («) the recreation of the sequential nature of human 
glycosylation in the ER and Golgi. Although the first step involves 
the generation of gene knockouts (e.g., a-1,6- and/or a-1,3- 
mannosyltransferases), the second step requires the proper local- 
ization of active mannosidases, glycosyltransferases, and possibly 
nucleotide sugar transporters to specific organelles. Moreover the 
formation of certain sugar-nucleotide precursor pools such as 
CMP-sialic acid may have to be engineered into the yeast host. 
Much is known about the localization of endogenous proteins in the 
secretory pathway of S. cerevisiae and other yeasts; however, there 
is no reliable method to predict whether a Golgi protein from one 
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Fig. 1. {A) N-l inked glycosylation pathway in humans and R pastoris. Mns, 
a-1,2-mannosidase; MnT, mannosyltransferase; Ochip, the initiating 1,6- 
mannosyitransf erase. (B) Structure of typical type II membrane glycosidase or 
glycosyltransferase. 



organism will be properly localized in the Golgi of another organ- 
ism. It is also unknown whether artificial in-frame protein fusions 
consisting of a yeast localization sequence and a catalytic domain 
from a nonyeast source (e.g., glycosyltransferases or mannosidases) 
will (i) localize to the desired organelle and (ii) show sufficiently 
high activity in the targeted environment. Although proper local- 
ization is important, it is not sufficient. Because the activity of the 
catalytic domain has to be maintained in the environment in which 
it has been localized, additional aspects such as pH optima must be 
considered. For example, Chiba et al (11) found that localizing a 
mannosidase with a pH optimum of 5 in the ER of 5. cerevisiae 
results in only modest intracellular mannosidase activity. 

To overcome these major constraints we developed two tools: 
a combinatorial genetic library, generating hundreds of fusion 
constructs at a time, and a high-throughput screen that allows us 
to analyze large numbers of strains in parallel for their ability to 
modify N-glycans of recombinant reporter proteins. The com- 
binatorial genetic library consists of an array of different fusion- 
protein constructs, each of which contains a fungal cellular 
targeting sequence fused in frame to a catalytic domain (e.g., 
mannosidase). Each of the 608 generated mannosidase fusion 
constructs was tested individually for its ability to catalyze the 
trimming of higher Man structures to (Man)5-(GlcNAc)2. Those 
strains that were able to generate mostly (Man)5-(GlcNAc)2 on 
a secreted reporter protein were subjected first to a second 
screen to ensure that trimming occurred in vivo and then 



engineered further to generate GlcNAc-(Man)5-(GlcNAc)2 by 
screening a similar j3-l,2-A^-acetylgIucosaminyltransferase I 
(GnTI)/leader library. A more comprehensive article describing 
the characteristics of >600 leader/mannosidase and leader/ 
GnTI fusions is in preparation (B.-K.C, P.B., R.C.D., S.R.H 
A. Stadheim, H.L., R.G.M., J.H.N., S.W., and T.U.G., unpub- 
lished data); however, some of the most important findings are 
reported in this article. 

Here we report the re-engineering of the secretory pathway in the 
methylotrophic yeast P, pastoris. The engineered strain produces 
predominantly N-glycans that are intermediates of the human 
gl/cosylation pathway, essentially void of fungal features. Our 
results suggest that further implementation of the described com- 
binatorial library approach will allow for the engineering of yeast 
strains with increasingly human N-glycosylation. This article reports 
a genetically engineered yeast capable of producing a glycoprotein 
with a human-like hybrid N-glycosylation structure. 

Materials and Methods 

Strains, Culture Conditions, and Reagents. E. coll strains TOPIO or 
DH5of were used for recombinant DNA work. P, pastoris GSl 15 
{his4, Invitrogen) or JC308 {ura3, adel, arg4, his4, a gift from 
James M. Cregg, Keck Graduate Institute, Claremont, CA) were 
used for generation of yeast strains. Protein expression was 
carried out at room temperature in a 96-well-pIate format with 
buffered glycerol-complex medium (BMGY) consisting of 1% 
yeast extract, 2% peptone, 100 mM potassium phosphate buffer 
(pH 6.0), 1.34% yeast nitrogen base, 4 X 10-^% biotin, and 1% 
glycerol as a growth medium. The induction medium was 
buffered methanol-complex medium (BMMY) consisting of 
1.5% methanol instead of glycerol in BMGY. Minimal medium 
is 1.4% yeast nitrogen base, 2% dextrose, 1.5% agar, and 4 x 
10~^% biotin and amino acids supplemented as appropriate. 
Restriction and modification enzymes were from New England 
BioLabs. Oligonucleotides were obtained from the Dartmouth 
College Core facility (Hanover, NH) or Integrated DNA Tech- 
nologies (Coralville, lA). The enzymes, peptide //-glycosidase F, 
mannosidases, and oligosaccharides were obtained from Glyko 
(San Rafael, CA). Metal chelating HisBind resin was from 
Novagen. Lysate-clearing plates (96-well) were from Promega. 
Protein-binding 96-weIl plates were from Millipore. Salts and 
buffering agents were from Sigma. Matrix-assisted laser desorp- 
tion ionization (MALDI) matrices were from Aldrich. 

Cloning and Deletion of the P. pastoris 0CH1 Gene. The 1,215-bp 
ORF of the P. pastoris OCHl gene encoding a putative a-1,6- 
mannosyltransferase was amplified from P. pastoris genomic 
DNA (strain X-33, Invitrogen) by using the oligonucleotides 
5'-ATGGCGAAGGCAGATGGCAGT-3' and 5'-TTAGTC- 
CTTCCAACTTCCTTC-3', which were designed based on the 
P. pastoris OCHI sequence (12). Subsequently, 2,685 bp up- 
stream and 1,175 bp downstream of the ORF of the OC///gene 
were amplified from a P, pastoris genomic DNA library (gift 
from Judah Folkman, Harvard Medical School, Boston) by using 
the internal oligonucleotides 5'-ATGGCGAAGGCAGATG- 
GCAGT-3' and 5'-ACTGCCATCTGCCTTCGCCAT-3' in the 
OCHI gene with T3 (5'-AATTAACCCrCACTAAAGGG-3') 
and T7 (5'-GTAATACGACTCACTATAGGGC-3') in the 
backbone of the library bearing plasmid A ZAP II (Stratagene). 
The resulting 5,075-bp fragment was cloned into the pCR2.1- 
TOPO vector (Invitrogen) and designated pBK9. To create an 
ochl knockout strain containing multiple auxotrophic markers, 
100 iu,g of pJN329, a plasmid containing an ochl\:URA3 mutant 
allele (J.F.N., S.W., and T.U.G., unpublished data; see Fig. 2/4 
for details) was digested with Sfi\ and used to transform P. 
pastoris strain JC308 by electroporation. After incubation on 
defined medium lacking uracil for 10 days at room temperature, 
1,000 colonies were picked and restreaked. URA"^ clones that 
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Fig. 2. Knockout of OCHI in A pastoris. {A) The OCHI knockout pi asm id 
pJN329 contains the P. pastoris URA3 gene flanked by LacZ repeats and 2,878 
bp upstream (5') and 1,01 1 bp downstream (3') of the OCHI gene of R pastoris. 
(fi) I mm u nob lot of Ochip in P/cAi/'a wild type (JC308, lane 1)andoch7 mutant 
(BK64-1, lane 2). The same amount of cell-free extract (50 per lane) was 
used, andOchIp was detected by using an Ochip peptide antibody followed 
by ECL. (O The reporter protein K3 was expressed in Pichia wild type BK64 
(lane 1) and an ochi mutant, P. pastoris BK64-1 (lane 2). KB was purified by 
Ni-affinity chromatography, separated by SDS/PAGE (4-20% gradient) under 
reducing conditions, and visualized by silver staining. The same amount of K3 
(200 ng per lane) was loaded. 



were unable to grow at 3TC but grew at room temperature were 
subjected to colony PGR to test for the correct integration of the 
ochl::URA3 mutant allele. One clone that exhibited the ex- 
pected PGR pattern was designated YJN153. 

Reporter Protein-Expression Construct. The Kringle 3 domain of 
human plasminogen (K3) was used as a model protein. A DNA 
fragment encoding the K3 (gift from Nick Menhart, Illinois 
Institute of Technology, Ghicago) was amplified by using Pfu 
turbo polymerase (Stratagene) and cloned into EcoRl andZ^al 
sites of pPIGZoA (Invitrogen), resulting in a G-terminal 6-His 
tag. To improve the N-linked glycosylation efficiency of K3 (13), 
Pro-46 was replaced with Ser-46 by using site-directed muta- 
genesis. The resulting plasmid was designated pBK64. The 
correct sequence of the PGR construct was confirmed by DNA 
sequencing. 

Integration vectors and fusion constructs were generated 
based on the roll-in plasmids described by Lin Cereghino and 
coworkers (14); see details in Supporting Methods, which is 
published as supporting information on the PNAS web site, 
www.pnas.org. 

Generation of Yeast Strains. To create P, pastoris strains expressing 
the K3, plasmid pBK64 was transformed into strains GS115 and 
YJN153, and colonies were selected on YPD medium containing 
100 /xg/m! zeocin to create strains BK64 and BK64-1, respec- 
tively. Plasmid pPB103 was linearized with EcoNl and trans- 
formed into strain BK64-1, and colonies were selected on 
defined medium lacking adenine. One strain containing the 
Kluyveromyces lactis MNNl-l gene was designated PBPl. Plas- 
mids pBB27 and pBG4 were linearized with Sail and trans- 
formed into strain PBPl, and colonies were selected on defined 
medium lacking histidine. Strains that were confirmed to contain 
the Caenorhabditis elegans a-l,2-mannosidase IB BB27 and BG4 
fusion constructs were designated YJN188 and YJN168, respec- 
tively, Plasmid pNA15 was linearized withy4fl/II and transformed 
into strain YJN168, and colonies were selected on minimal 
medium without amino acids. One strain that was confirmed to 
contain the human GnTI gene fusion was designated YJN201. 

Western Blotting. Proteins were separated by 4-20% gradient 
SDS/PAGE according to Laemmli (15) and then electroblotted 
onto nitrocellulose membrane (Schleicher & Schuell) as de- 
scribed (16). P. pastoris Ochlp was detected by using an antibody 
raised against the peptide GQQLSSPKIDYDPLTL (Sigma- 
Genosys) with an EGL kit (Amersham Pharmacia). 



Protein Purification. K3 was purified from the medium by Ni- 
affinity chromatography by using a 96-well format on a Beckman 
BioMek 2000 laboratory robot. The robotic purification is an 
adaptation of the protocol provided by Novagen for their 
HisBind resin. 

Release of N-Linked Glycans. The glycans were released and sep- 
arated from the glycoproteins by a modification of a previously 
reported method (17). After the proteins were reduced and 
carboxymethylated and the membranes were blocked, the wells 
were washed three time with water. The protein was deglyco- 
sylated by the addition of 30 fx\ of 10 mM NH4HGO3 (pH 8.3) 
containing 1 milliunit of N-glycanase (Glyko). After 16 h at ST^G, 
the solution containing the glycans was removed by centrifuga- 
tion and evaporated to dryness. 

MALDI/Time-of-Flight (TOP) Mass Spectrometry. Molecular weights 
of the glycans were determined by using a Voyager DE PRO linear 
MALDI/TOF (Applied Biosciences) mass spectrometer with de- 
layed extraction. The dried glycans from each well were dissolved 
in 15 /il of water, and 0.5 ^1 was spotted on stainless-steel sample 
plates and mixed with 0.5 of S-DHB matrix (9 mg/ml of 
dihydroxybenzoic acid/1 mg/ml of 5-methoxysalicylic acid in 1:1 
water/acetonitrile/0.1% trifluoroacetic acid) and allowed to dry. 
Ions were generated by irradiation with a pulsed nitrogen laser (337 
nm) with a 4-ns pulse time. The instrument was operated in the 
delayed extraction mode with a 125 -ns delay and an accelerating 
voltage of 20 kV. The grid voltage was 93,00%, guide wire voltage 
was 0.1%, the internal pressure was <5 x 10"'' torr (1 torr = 133 
Pa), and the low mass gate was 875 Da. Spectra were generated 
from the sum of 100-200 laser pulses and acquired with a 500-N4Hz 
digitizer. (Man)5-(GlcNAc)2 oligosaccharide was used as an exter- 
nal molecular weight standard. All spectra were generated with the 
instrument in the positive-ion mode. 

Mannosidase Assays. Fluorescence-labeled (Man)8-(GlcNAc)2 
(0.5 /Ag) was added to 20 jllI of supernatant and incubated for 30 h 
at room temperature. After incubation the sample was analyzed 
by HPLG with an Econosil NH2 4.6 X 250-mm, 5-fxm bead, 
amino-bound silica column (Alltech, Avondale, PA). The flow 
rate was 1.0 ml/min for 40 min, and the column was maintained 
at 30°G. After eluting isocratically (68% A:32% B) for 3 min, a 
linear solvent gradient (68% A:32% B to 40% A:60% B) was 
used over 27 min to elute the glycans (18). Solvent A was 
acetonitrile, and solvent B was an aqueous solution of ammo- 
nium formate, 50 mM (pH 4.5). The column was equilibrated 
with solvent (68% A:32% B) for 20 min between runs. 

Results 

Generation of an Qc-1,6-Mannosy (transferase Deletion Mutant in P. 
pastoris. In S. cerevisiae the OCHI gene product (Ochlp) is an 
a-l,6-mannosyltransferase that initiates the outer-chain elonga- 
tion of N-linked glycans in the early Golgi. An ochl null mutant 
strain in S. cerevisiae shows no a-l,6-Man linkage to the core 
glycan structure and consequently lacks hyperglycosylation (19). 
To generate a deletion of the OCHI homolog in P. pastoris an 
ochl ::URA3 mutant allele was constructed and transformed into 
a ura3 strain of P. pastoris (JG308). Several ochl mutant strains 
were identified and confirmed by PGR and Western blotting 
(Fig. 2B). The P. pastoris ochl mutant strain exhibited temper- , 
ature sensitivity and increased flocculation similar to that ob-| 
served in S. cerevisiae (data not shown). 

Glycan structures of heterologous proteins expressed in P, 
pastoris are heterogeneous, mostly consisting of (Man)i()_i6- 
(GlcNAc)2 mannans with varying degrees of charged glycans (10, 
20). To monitor the effect of different engineering steps on the 
glycosylation of P. pastoris, a secreted form of K3 was used as a 
reporter protein. Hyperglycosylation observed by SDS/PAGE 
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analysis when K3 was expressed in a wild-type P. pastoris strain 
was eliminated in the ochl mutant strain (Fig. 2C). N-glycans 
released from secreted K3 were analyzed directly by MALDI/ 
TOF mass spectrometry. N-glycans of K3 expressed in a P, 
pastoris wild-type strain were found to have a molecular mass 
consistent with (Man)io_i6-(GlcNAc)2, confirming previous 
findings in this yeast (Fig. 2>A). By comparison, the N-glycans of 
K3 in an ochl mutant strain revealed predominantly (Man)8-i2- 
(GlcNAc)2, representing a noticeable shift to smaller glycans 
(Fig. 35) and a complete elimination of the smearing observed 
in the SDS/PAGE analysis (Fig. 2C). To rule out the possibility 
of protein-specific interactions a second reporter protein, full- 
length human IFN-j3, was used and revealed identical results 
(data not shown). 

Construction of ER/Golgi leader, a-1,2-IVIannosidase, and GnTI 
Libraries. To sequentially localize different mannosidases and Gn- 
TIs along the early secretory pathway of P. pastoris three separate 
gene libraries were designed. The first library (the leader library) 
contained DNA fragments encoding N-terminai peptides of known 
type II membrane proteins that either localize in the ER or Goigi 
of S. cerevisiae and P. pastoris. They include GIsl, Mnsl, Secl2, 
Mnn9, Vanl, Anpl, Hocl, and MnnlO, Mnnll from S. cerevisiae 
and Ochl and Secl2 from P. pastoris. The generation of the 
respective DNA constructs is exemplified by the construction of 
Mnsl, Mnn9, and MnnlO leaders in Materials and Methods. A 
second library contained catalytic domains of a-l,2-mannosidases 
from Homo sapiens, Mus musculuSy Aspergillus nidulans, C. elegans, 
Drosophila melanogaster, and Penicillium citrinium. The generation 
of the respective DNA constructs is exemplified by the construction 
of mannosidase IB from C. elegans in Materials and Methods. 
Finally, a third library contained catalytic domains of GnTI genes 
from//, sapiens, C. elegans, Xenopus laevis, andD. melanogaster. All 
libraries were designed in a way that any combination of a leader 
construct and a catalytic domain created a gene encoding a 
chimeric fusion protein. 

Each leader fragment was represented in three lengths. The 
short form encoded the N-terminal cytoplasmic tail and the 
transmembrane domain. The long form encoded additional 
residues containing the complete stem region up to the respec- 
tive catalytic domain, which was determined by sequence ho- 
mology to known catalytically active fragments of such enzymes. 
The medium form was an intermediate version containing 
approximately half of the stem region in addition to the sequence 
encoded in the short form. Catalytic domains were selected to 
cover a wide range of pH optima as determined from literature 
data (e.g., P. citrinium andM. musculus) and temperature optima 
by selecting domains from organisms that exist at different 
temperatures (e.g., C elegans and K sapiens). Each catalytic 
domain was represented in several lengths and generally lacked 
the native N-terminal cytosolic and transmembrane domain. 
Some of the catalytic domains were selected solely on the basis 



Fig. 3. Positive-ion MALDI/TOF mass spectra of N-Iinked glycans released 
from K3. K3 was produced In P. pastor/s strains BK64, BK64-1, YJN168, YJN188, 
and YJN20 1 and purified from culture supernatants by Nl-aff inity chromatog- 
raphy. The glycans were released from K3 by peptide W-glycosidase F treat- 
ment. The released N-linked glycans were analyzed by MALDI/TOF mass 
spectrometry, typically appearing as the sodium or potassium adducts. {A) 
BK61, wild-type strain of P. pastoris expressing K3. (fi) BK64-1, ochl deletion 
expressing K3. (O YJN168, ochl deletion expressing K3, K. lactis UDP-GlcNAc 
transporter, and C elegans a-1,2-mannosidase IB fused to MNSI. (O) YJN18B, 
ochl deletion expressing K3, K. lactis UDP-GlcN Ac transporter, and C. elegans 
a-1,2-mannosidase IB fused to MNN10. (£) YJN201, ochl deletion expressing 
K3, K. lactis UDP-GlcN Ac transporter, C. elegans a-1,2-mannosidase IB fused to 
/W/VS7, and human GnTI fused to MNN9. {F) YJN201 after 0-W-acetyl hex- 
osaminidase treatment. M, Man. 
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Table 1. Relative amount of (Man)5 on secreted K3 

Amount of {Man)5 on secreted K3, 
% of total glycans 



Number of constructs {%) 



ND* 

0-10 

10-20 

20-40 

40-60 

>60 

Total 



19(3.1) 
341 (56.1) 

50 (8.2) 
75 (12.3) 
72 (11.8) 

51 (8.4)+ 
608(100) 



Six hundred and eight different strains of P. pastohs (oc/) 7) were generated 
by transforming them with a single construct of a combinatorial genetic 
library that was generated by fusing 19 a-1,2-mannosidase catalytic domains 
to 32 fungal ER and cis-Golgi leaders. 

♦Several fusion constructs were not tested because the corresponding plas- 
mids could not be propagated in £ coli before transformation into P. 
pastoris. 

^Clones with the highest degree of (Man)5 trimming (30/51) were analyzed 
further for mannosldase activity in the supernatant of the medium. The 
majority (28/30) displayed detectable mannosidase activity In the superna- 
tant (e.g. Fig. 4B). Only two constructs displayed high (Man)5 levels while 
lacking mannosidase activity in the medium (e.g. Fig. 40- 



of sequence homology to other known a-l,2-niannosidases (e.g. 
C elegans) or GnTIs and had not been characterized previously. 

After screening the fusion libraries chimeric constructs were 
identified that displayed a high degree of Man-trimming activity 
and UDP-GlcNAc transfer activity on the reporter protein K3. 
A detailed analysis of the characteristics of the fusion libraries 
will be published elsewhere (B.-K.C., P.B., R.C.D., S.R.H., 
A. Stadheim, H.L., R.G.M., J.H.N., S.W., and T.U.G., unpub- 
lished data). 

Expression of a-1,2-Mannosidase Fusion Constructs in a P. pastoris 
ochi Mutant Strain. After screening a library of 608 leader/a-1,2- 
mannosidase fusions targeted to the ER and early Golgi, several 
clones were identified that produced N-glycans consistent with 
a mass of (Man)5-(GlcNAc)2. Specifically, a putative C elegans 
homolog of known a-l,2-mannosidases showed a high degree of 
trimming to (Man)5-(GlcNAc)2 when fused to both the S. 
cerevisiae MNSl and MNNIO leader-encoding fragments. Al- 
though these constructs and several others resulted in glycans, 
which were 70-80% or more (Man)5-(GlcNAc)2 (Fig. 3 C and 
D), >56% of the fusions resulted in <10% (Man)5-(GlcNAc)2 
(Table 1). These data clearly emphasize the importance of 
choosing the proper combination of (i) a localization sequence 
and («) an a-l,2-mannosidase catalytic domain of the proper 
length. Fungal a-l,2-mannosidases with acidic pH optima (e.g., 
P. citrinium and/i. nidulans), when expressed as fusions with the 
leader library, generally resulted in low (Man)5-(GlcNAc)2 yields 
(data not shown) consistent with previous findings (11, 21). 

Because previous researchers have found that trimming to 
(Man)5-(GlcNAc)2 was often accompanied by leakage of man- 
nosidase into the medium (22), we further investigated whether 
Man trimming occurred in vivo, in the Golgi, or ex vivo after 
secretion of the protein into the medium. To determine the 
extent of mannosidase activity in the medium, 2-aminobenz- 
amide-labeled (Man)8-(GlcNAc)2 was used to assay the culture 
supernatant. Many of the efficient (Man)5-(GlcNAc)2-producing 
constructs displayed a high degree of mannosidase activity in the 
supernatant, suggesting that at least some of the observed 
(Man)5-(GlcNAc)2 structures were produced ex vivo. However, 
by applying a double screen we were able to identify specific 
chimeric fusions that were entirely retained intracellularly while 
at the same time displaying high in vivo a-l,2-mannosidase 
activity (Fig. 4C). 
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Fig. 4. Extracellular activity of C e/egans ot-1,2-mannosidase IB. 2-Amino- 
benzamide-labeled (Man)8-(GlcNAc)2 was Incubated with culture superna- 
tants at 25''C for 30 h and analyzed by HPLC. {A) (Man)8-(GlcNAc)2 standard in 
BMMY medium (no cells), (fl) Supernatant from YJN188. (O Supernatant from 
YJN168. M, Man. 



Expression of GnTI Fusion Constructs in a P. pastoris ochi Mutant 
Strain Containing an Active a-1,2-Manno5idase. Further humaniza- 
tion of (Man)5-(GlcNAc)2 glycan structures involves the subse- 
quent in vivo conversion of (Man)5-(GlcNAc)2 to GlcNAc- 
(Man)5-(GlcNAc)2. This step requires UDP-GIcNAc as a 
substrate and involves the transfer of GlcNAc to (Man)5- 
(GlcNAc)2 by GnTI (Fig. 1). To ensure sufficient levels of 
UDP-GlcNAc in the Golgi we cloned the UDP-GlcNAc trans- 
porter from K lactis into an ochi mutant strain, which also 
displays high in vivo mannosidase activity, 

A leader/GnTI fusion library containing 67 constructs was 
screened. Among several active fusion constructs, one consisting 
of human GnTI and a leader sequence from S, cerevisiae MNN9 
was particularly active, and the corresponding strains yielded 
K3-containing glycans with a mass consistent with GlcNAc- 
(Man)5-(GIcNAc)2 almost exclusively (Fig. 3£). Moreover, these 
glycans were converted completely to (Man)5-(GlcNAc)2 by in 
vitro //-acetylhexosaminidase digestion, further indicating that 
this strain secretes protein almost uniformly modified with 
glycans of the structure GlcNAc- (Man)5-(GlcNAc)2, a well 
described human glycosylation intermediate (Fig. 3F). Impor- 
tantly, GnTI activity was assayed in the culture medium of this 
strain and found to be absent (data not shown). Finally, to 
determine whether the UDP-GlcNAc transporter was required 
to provide sufficient amounts of substrate for GnTI, a strain 
producing (Man)5-(GlcNAc)2 was transformed with the same 
GnTI construct in the absence of the transporter. Although some 
GlcNAc-(Man)5-(GlcNAc)2 was observed from the purified K3, 
a large percentage of glycans consistent with (Man)5-(GlcNAc)2 
remained, indicating that GlcNAc transfer was less efficient than 
in the strain containing the transporter (data not shown). This 
result confirms the purported necessity of a UDP-GlcNAc 
transporter for efficient GlcNAc transfer originally found in IC 
lactis (23). 

Discussion 

P, pastoris is one of several yeasts capable of high-level production 
of heterologous glycoproteins (24), As with any other fungal 
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protein-expression systems, the heterologous glycoprotein is glyco- 
sylated in a fungal-like fashion generally involving the addition of 
a-1,6- and a-l,2-linked Man and mannosylphosphate to the 
(Man)8-(GlcNAc)2 core (Fig. 1; ref. 25). This yeast-type glycosyl- 
ation pattern is recognized by the human immune system, which 
renders the underlying protein unfit for therapeutic use. In view of 
the current shortage of efficient glycoprotein-expression systems we 
have focused our efforts on engineering commercially relevant 
yeast and filamentous fungi to produce N-glycans with human-like 
glycosylation structures. In this study we report the construction of 
a strain that produces high levels of a reporter protein modified with 
a human glycosylation intermediate in the P. pastoris system. 

The first step in attaining complex human-type glycans in a 
fungal system is to eliminate yeast-type glycosylation. To accom- 
plish this, the P. pastoris OCHl gene was deleted in a strain 
secreting the reporter protein K3. Yeast-type hyperglycosylation 
was abrogated in this strain as observed by silver staining of 
secreted K3 (Fig. 2C), and when released N-glycans were 
analyzed by MALDI/TOF, a general trend toward smaller 
structures was observed (Fig. 3B). 

Once an appropriate core high-Man glycan is obtained, the 
next immediate step in the conversion to human-type N-glycans 
involves the functional expression and localization of an a-1,2- 
mannosidase. This enzyme will trim the (Man)8-(GlcNAc)2 core 
structure to (Man)5-(GlcNAc)2 and thereby generate the struc- 
ture that is capable of receiving the GlcNAc that initiates the 
formation of hybrid N-glycans. A very similar approach was 
taken in a triple mutant ochl mnnl mnn4 strain of 5. cerevisiae 
(11). ER localization of a fungal a-l,2-mannosidase (from 
Aspergillus saitoi) was accomplished by adding the tetrapeptide 
HDEL as an ER retrieval tag to the C terminus of the gene. By 
using the S. cerevisiae GAPDH promoter and a multicopy 
number plasmid, mannosidase activity was detectable in cell-free 
extracts; however, only 27% of the N-glycans of an endogenous 
marker protein (carboxypeptidase Y) were trimmed from 
(Man)8-(GIcNAc)2 to (Man)5-(GlcNAc)2 in vivo. Although not 
entirely successful, the pioneering work of Chiba et al (11) 
demonstrates that N-glycans from S. cerevisiae can be modified 
substantially by engineering glycosylation pathways. Here the 
localization of catalytic domains to the secretory pathway of the 
related yeast P. pastoris using yeast type II membrane protein 
leader domains is demonstrated. The benefit of a large library of 
leader domains along with an equally diverse library of catalytic 
domains allowed for the selection of the most active fusion 
constructs from a pool of >600 candidates, many of which were 
marginally active or not at all. Despite the lack of success of many 
constructs, a few particularly active enzyme fusions were able to 
trim the core (Man)8_i2-(GlcNAc)2 glycans observed in the ochl 
mutant strain to (Man)5-(GlcNAc)2. Two of these constructs 
described here, which are fusions of a putative C. elegans 
a-l,2-mannosidase with different yeast type II leader domains, 
are able to do so at high efficiency (>75%). 
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The correct localization of enzymes involved in glycosylation 
is critical to allow for the sequential glycan modifications as 
glycoproteins proceed through the secretory pathway. However, 
one additional concern has come from another previous attempt 
to engineer glycans in P. pastoris, which was undertaken with an 
a-l,2-mannosidase from T. reesei (22). In this study immunof lu- 
orescent microscopy was used to demonstrate that a mannosi- 
dase-myc-HDEL fusion localized primarily in the ER of P. 
pastoris; however, leakage into the medium was also observed by 
Western blotting. Because the secreted glycoprotein of interest 
will be in the supernatant for many hours, the cosecretion of 
mannosidases into the medium is of concern because it can be 
misinterpreted as in vivo activity. It is important to generate 
(Man)5-(GlcNAc)2 structures in vivo, and early in the secretory 
pathway, if subsequent conversion to complex glycans is to be 
achieved. Although it is well established that the £/^2-based 
retrieval system is leaky and retention of the HDEL-tagged 
mannosidase in the secretory pathway cannot be ensured (26), 
the same concern is emphasized further by the striking differ- 
ence demonstrated here between the two different C elegans 
a-l,2-mannosidase IB fusion constructs. Thus, the use of a series 
of different fusion constructs with many differentially localized 
type II domains has allowed us to screen for chimera with activity 
that is completely in vivo. 

After efficient trimming of the core glycan to (Man)5- 
(GlcNAc)2 the next step in the conversion of high Man-type 
glycans to hybrid- and complex- type glycans involves the expres- 
sion and localization of the enzyme GnTI. Here again a library 
of GnTI catalytic domains was used and allowed for the screen- 
ing for and selection of several particularly active GnTI fusion 
constructs for further study. One particularly active fusion using 
the human GnTI catalytic domain shown here converts the 
(Man)5-(GlcNAc)2 substrate to the desired GlcNAc-(Man)5- 
(GlcNAc)2 product almost quantitatively, an activity that was 
shown to be completely in vivo. Furthermore, the demonstrated 
activity of GnTI on the (Man)5-(GlcNAc)2 substrate is the best 
evidence that the a-l,2-mannosidase activity is indeed occurring 
in the secretory pathway. This is a demonstration of a high-level 
hybrid N-glycan modification of a secreted protein in yeast and 
represents a significant step toward the ability to express fully 
human glycoproteins in yeast. Although the generated structures 
are expected to be nonimmunogenic in humans, additional Man 
removal (i.e., the removal of 1,6- and 1,3-Man from the tri- 
mannose core) and further addition of /3-1,2-GlcNAc will be 
required to generate complex N-glycans of therapeutic utility 
(e.g., for the production of monoclonal antibodies). 
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